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Abstract. We propose a way of associating to eacli finitely generated 
monoid or semigroup a formal language, called its loop problem. In the 
case of a group, the loop problem is essentially the same as the word 
problem in the sense of combinatorial group theory. Like the word prob- 
lem for groups, the loop problem is regular if and only if the monoid is 
finite. We study also the case in which the loop problem is context-free, 
showing that a celebrated group-theoretic result of MuUer and Schupp 
extends to describe completely simple semigroups with context-free loop 
problems. We consider also right cancellative monoids, establishing con- 
nections between the loop problem and the structural theory of these 
semigroups by showing that the syntactic monoid of the loop problem 
is the inverse hull of the monoid. 



1. Introduction 

One of the most productive and successful approaches to finitely generated 
groups is the study of the language of all words over a given finite generating 
set which represent the identity element. Many of the structural properties 
of a group are reflected in the language-theoretic properties of this language. 
For example, the language is regular, context-free, recursive or recursively 
enumerable exactly if the group is finite, virtually free, embeddable in a 
simple subgroup of a finitely presented group, or embeddable in a finitely 
presented group respectively [H [22l IH ITij. 

In a group, words u and v over the generators represent the same element 
exactly if uv~^ represents the identity. It follows that deciding membership 
for the language of words representing the identity is algorithmically equiv- 
alent to the word problem in the sense of universal algebra ^1 Section 30], 
that is, the problem of deciding whether two given words in the generators 
represent the same element of the group. In particular, the language en- 
codes the multiplication, and hence the entire structure, of the group. For 
this reason, the language of words representing the identity is usually termed 
the word problem of the group. 

By contrast, in a more general finitely generated monoid, there is no 
equivalence between the language of words representing the identity, and 
the universal algebraic word problem. One need only consider the class of 
monoids obtained by adjoining identities to semigroups, to see that radi- 
cally different monoids can give rise to the same language. It is perhaps 
for this reason that recent research in combinatorial semigroup theory has, 
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with a few exceptions [71 TU' , eschewed direct interaction with formal lan- 
guage theory. Although automata appear with increasing frequency (as in 
the emerging theory of automatic semigroups |171 EI ) , they are typically of 
a "multi-tape" form which does not admit a direct language-theoretic inter- 
pretation. However, in view of the numerous and deep interactions between 
combinatorial group theory and formal languages, and also between finite 
semigroup theory and formal languages (see, for example, [23] ), it would 
be surprising if the theory of formal languages had no role to play in the 
combinatorial theory of finitely generated semigroups and monoids. 

The purpose of this article is to propose and begin the study of a new 
way of associating to each finitely generated monoid or semigroup a formal 
language, which we call the loop problem. Like the word problem for groups, 
this language encapsulates the entire structure of the semigroup, at least in 
the simplistic sense that it uniquely determines the semigroup. A less trivial 
question is that of whether there is a correspondence between "natural" 
structural properties of a semigroup or monoid, and the "natural" language- 
theoretic properties of its loop problem, and it is this question in which we 
are primarily interested. 

In addition to this introduction, this paper comprises six sections. In 
Section |2 we briefiy revise some elementary automata theory. Section El 
introduces the loop problem of a finitely generated monoid or semigroup, 
while Section |1] establishes some of its basic properties, including the extent 
to which it is invariant under change of generators. 

SectionElconsiders the case in which the monoid is a group, describing the 
precise relationship between the loop problem and the word problem. More 
generally, we study the important class of finitely generated completely sim- 
ple semigroups, establishing a natural relationship between the loop prob- 
lems of such semigroups and the loop problems (and word problems) of their 
maximal subgroups. 

Section El begins the study of the interaction between natural structural 
properties of semigroups, and language-theoretic properties of their loop 
problems. We begin with a semigroup-theoretic analogue of a basic result 
of Anisimov T from combinatorial group theory — a monoid is finite if 
and only if its loop problem is regular. We consider also the classes of 
context-free languages; we apply a result from Section El to show that a 
celebrated theorem of Muller and Schupp 22 from combinatorial group 
theory extends to characterise completely simple semigroups with context- 
free loop problems, and ask what can be said more generally about the class 
of semigroups with this property. 

Finally, in Sectional we consider the loop problem in the important spe- 
cial case of right cancellative monoids. It transpires that there is a simple 
and elegant relationship between the between the loop problem and the 
established structural theory of these monoids (HI EH]. In particular, the 
syntactic monoid of the loop problem is the inverse hull of the monoid. 

2. Automata 

In this paper, a central role will be played by infinite automata over fi- 
nite alphabets. It is well-known, if not so well documented, that much of 



THE LOOP PROBLEM 



3 



the classical theory of finite automata extends easily to the infinite case, 
provided sufficient care is taken with the definitions. In this section, we 
briefiy introduce some fundamental definitions and results of automata the- 
ory, partly so as to make the paper accessible to the reader not familiar with 
classical automata theory, and partly to establish the necessary foundations 
in the infinite case. Proofs are not given; they can be found in the finite 
case in the standard textbooks on formal language theory |15|l2,Sj. while the 
appropriate adjustments for the infinite case are left as an exercise for the 
reader. 

Let X be a finite set of symbols or letters, called an alphabet. A word over 
X is a finite sequence of letters from X; the empty word, with no symbols, is 
denoted e. The collections of all words over X forms a monoid (with identity 
e) under the operation of concatenation; it is called the free monoid on X 
and denoted X*. The non-empty words comprise a subsemigroup of X*, 
called the free semigroup on X and denoted X~^ . It is readily verified that 
free monoids and semigroups satisfy the usual universal property defining 
free objects in the categories of monoids and semigroups respectively. A 
language over X is a collection of words over X, that is, a subset of the free 
monoid X* . 

An automaton over X consists of a directed graph with edges labelled by 
elements of X* , with a distinguished start vertex and a set of distinguished 
terminal vertices. The vertices and edges of an automaton are sometimes 
called states and transitions respectively. The labelling of edges extends 
naturally, via the multiplication in X* , to a labelling of paths by words in 
X* . The language accepted by the automaton is the set of all words in X* 
which label paths from the initial vertex to a terminal vertex. 

The automaton is called finite if the vertex and edge sets are finite; a 
language accepted by a finite automaton is called a regular language. Many 
textbooks on automata focus exclusively on the finite/regular case, and use 
the terms "automaton" and "finite automaton" interchangeable. In this 
article, we are equally interested in the infinite case, and an "automaton" 
may be infinite unless explicitly stated otherwise. 

An automaton is called deterministic if its edges are labelled by elements 
of X, and for each state q and each letter x (z X, there is at most one 
edge leaving q with label x. An automaton is called trim if for every vertex 
q, there is a path from the initial state to q, and a path from q to some 
terminal state. Two automata are isomorphic if there is an isomorphism 
between their underlying graphs which preserves edge labels, the start state 
and the set of terminal states. 

Let Q be the state set of the automaton. For each word w G X* , one can 
define a binary relation 

(^w = {(Pi q) I there is a path from p to q labelled w} Q x Q. 

The set of all relations of the form aw constitutes a subsemigroup of the 
monoid of all binary relations on Q; the multiplication is given by au(Tv = 
auv The relation cr^ is easily seen to be act as an identity for this semi- 
group (even though it may not be the identity function on Q), so in fact the 
semigroup is a monoid, called the transition monoid of the automaton. In 
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the case that the automaton is deterministic, the relations are all par- 
tial functions, is actually the identity function on Q, and the transition 
monoid is generated by the maps ax for letters x ^ X. 

The cone of a vertex q is the language of all words which label paths from 
q to some terminal vertex. Two vertices are called equivalent if they have 
the same cone. 

Proposition 2.1. Let L he a language. Then there exists a unique (up 
to isomorphism) deterministic trim automaton M{L) accepting L, with the 
property that no two vertices are equivalent. 

The automaton given by Proposition 12 . II is called the minimal automaton 
of the language L. It is a final object in the category of all deterministic 
trim automata accepting L with an appropriate notion of morphism. An 
easy corollary is that L is regular exactly if M{L) is finite. Moreover, if L 
is regular then M{L) has strictly fewer states than any other deterministic 
automaton accepting L; many authors concerned only with finite automata 
take this property as the definition of the minimal automaton, although 
even in the finite case the morphism viewpoint is arguably more helpful for 
understanding the theory. In the infinite case, of course, the cardinality of 
the state set is a wholly insufficient to characterize the minimal automaton. 

Let L be a language over an alphabet X. We say that two words u,v & X* 
are syntactically equivalent (with respect to L), and write u =l v, if for every 
pair of words x,y (z L, we have xuy S L if and only if xvy E L. Thus, two 
words are syntactically equivalent if one can substitute one for the other in 
any word, without affecting membership of the language L. We write [u]l 
for the syntactic equivalence class of a word u. It is readily verified that 
the relation of syntactic equivalence is a congruence on the free monoid X*, 
so that the syntactic equivalence classes form a monoid with multiplication 
well-defined by Ml^Il = [uv]l. This is called the syntactic monoid of L 
and denoted S{L). The following proposition relates syntactic monoids to 
minimal automata. 

Proposition 2.2. Let L be a language. Then the syntactic monoid S{L) 
is isomorphic to the transition monoid of the minimal automaton M{L) via 
the map [u\l i— cr^. 

We shall also need the notion of a rational transduction. We provide 
here a brief definition; for a detailed introduction, see [Hj. Let X and Y be 
finite alphabets. A finite transducer from X to y is a finite directed graph 
with edges labelled by elements of the direct product X* x Y*, and with a 
distinguished initial vertex and a set of distinguished terminal vertices. Just 
as with automata, the labelling of edges extends, via the multiplication in 
the direct product monoid X* x y*, to a labelling of paths. The relation 
accepted by the automaton is the set of all pairs in X* x Y* which label paths 
from the initial vertex to a terminal vertex. A relation between free monoids 
X and Y recognised by a finite transducer is called a rational transduction 
from X to Y. 

If p C X* X y* is a relation and L C X* is a language, then the image of 
L under p is the language 

Lp = {v I (u, f) G p for some u € L} C Y*. 
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If /? is a rational transduction then we say that Lp is a rational transduc- 
tion of L (via p). Two languages are called rationally equivalent if each is 
a rational transduction of the other; rational equivalence is, as the name 
implies, an equivalence relation on the class of all languages. The inverse of 
p is the relation 

p-i = {(n,v) I {v,u) G p} C y* X X*. 

It is easily seen that if p is a rational transduction then so is p~^. 

We shall also need another operation on languages. If L C X* is a 
language then the Kleene closure of L is the submonoid of X* generated by 
L, that is, the set of all words of the form wi . . . Wn for some integer n > 
and words wi, . . .Wn € L. The Kleene closure of L is usually denoted L*; 
note that if L = X is in fact the alphabet viewed as a subset of the free 
monoid it generates, then the Kleene closure of L is the entire free monoid 
on X, so the notation X* is unambiguous. 

3. The Loop Problem 

In this section, we introduce the languages which are the main object of 
study in this paper. We assume familiarity with the basic terminology and 
notation of semigroup theory [T^ and combinatorial group theory 20 , 
We define the loop problem first for monoids, and then more generally for 
semigroups. 

Let M be a monoid. By a choice of (monoid) generators for M we mean 
a surjective morphism a : X* — > M from a free monoid onto M. The choice 
of generators is called finite if X is finite. A monoid which admits a finite 
choice of generators is called finitely generated. Since our aim is to connect 
semigroup theory with the theory of formal languages (over finite alphabets) 
we are primarily interested in finitely generated monoids, although many of 
our results will also hold for infinite choices of generators. 

Recall that the (right) Cayley graph Tfj[M) of M with respect to a is a 
directed graph, possibly with multiple edges and loops, with edges labelled 
by elements of X. Its vertices are the elements of M, and it has an edge 
from a G M to 6 E M labelled x G X exactly if a{xa) = 6 in the monoid M. 

Now let X = {x I X G X} be a set of formal inverses for the generators 
in X, and let X = X yj X. We extend the map x ^ x to an involution 
on X* by defining x = x for all x G X, and xi . . . x„ = x^. . .xj for all 
xi, . . . ,x„ G X. (In fact, X* together with the unary operation x ^ x is 
the free monoid with involution on X ^19. Section 6.1].) 

The (right) loop automaton Tcj{M) of M with respect to X is obtained 
from the Cayley graph T„{M) by adding for each edge labelled x an inverse 
edge, in the opposite direction, labelled x. Notice that for each path tt in 
the loop automaton labelled w G X*, there is a corresponding path in the 
opposite direction labelled w; we denote this path vf. We view the loop 
automaton as a (typically infinite) automaton over X, with start state and 
terminal state the identity of M. The (right) loop problem of M with respect 
to a is the language L^{M) C X* of words recognised by the loop automaton 
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Words in X* and X are called positive and negative words respectively; 
words in X* which are neither positive nor negative are called mixed. Simi- 
larly, an edge or path in Ta{M) is called positive [negative, mixed] if it has 
a positive [respectively negative, mixed] label. 

We now introduce the corresponding definitions for more general semi- 
groups, possibly without identity. Suppose 5 is a semigroup. By a choice of 
(semigroup) generators for S we mean a surjective morphism a : X^ — > S 
from a free semigroup onto S; again, a is called finite if X is finite. Let 
be the semigroup obtained from S by adjoining an identity element, denoted 
1, even if there already is one. Then a extends uniquely to a monoid choice 
of generators : X* S^. We define the loop automaton T^iS) of S with 
respect to a to be the loop automaton ro-i(5^). 

Note that if M is a monoid, then it admits choices of generators, and hence 
Cayley graphs and loop problems, both as a semigroup and as a monoid. We 
shall see below fProposition 14. 7|) that there is a close relationship between 
the loop problems of M as a monoid and as a semigroup. 

For brevity, we introduce a few notational conventions. If X is a semi- 
group [respectively, monoid] generating subset for a semigroup [monoid] S, 
then the inclusion map X ^ S extends naturally to a formal choice of gen- 
erators a : X~^ — > 5 [c7 : X* ^5]. In this case, we write Tx{S), Tx{S) and 
Lx{S) for T(j{S), Ta{S) and L(^{S) respectively. Moreover, where the choice 
of generators is clear, we shall omit the subscript entirely, writing simply 
r(S'), r(5) and L{S). Similarly, if only one monoid or semigroup is under 
consideration, we simply write To-, To- and Lo-, or even just F, F and L. 

There is of course a dual notion of the left Cayley graph of a monoid or 
semigroup with respect to a given generating set, obtained by considering 
multiplication by generators on the left. Consequently, one obtains also the 
left loop automaton (which reads input words from right to left) and the 
left loop problem. In this paper, we shall restrict attention to right loop 
problems. The loss of generality through this restriction is limited by the 
fact that the left loop problem of M is simply the reverse of the right loop 
problem of the opposite monoid of M. 

In fact the loop problem is one of two closely related languages which can 
naturally be associated to a finitely generated monoid or semigroup. The 
other, which we might call the meeting problem, is simply the set of all words 
of the form uv such that u and v are words over the generators representing 
the same element of the monoid or semigroup. The meeting problem is 
clearly the intersection of the loop problem with the regular language X*X ; 
in contrast, the loop problem cannot be described so easily in terms of the 
meeting problem. Of the two, then, the loop problem is the more complex; 
it is more naturally analogous to the word problem of a group, and better 
describes the geometry of the Cayley graph of the monoid; the meeting 
problem is typically easier to work with and more closely related to the word 
problem in the sense of universal algebra, but is less geometric and does not 
directly generalise the usual notion of a word problem (as a language) for 
groups. One might reasonably expect that a language-theoretic restriction 
on the loop problem will place much stronger conditions on a semigroup 
than the same restriction applied to the meeting problem. 
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4. Basic Properties 

In this section we describe some basic properties of the loop problem. The 
broad objective is to show that finitely generated monoids and semigroups 
which are "similar" (that is, which enjoy similar structure) give rise to lan- 
guages which are "similar" (that is, belong to the same classes of languages, 
from amongst the classes most frequently arising in formal language theory) . 
The technique is to show that when one has two finitely generated monoids 
or semigroups which are closely related, one can often pass between the 
corresponding languages using simple language-theoretic operations such as 
morphisms, inverse morphisms, rational transductions and Kleene closure. 

The following lemma, which follows easily from the definitions, describes 
the relationship between paths in the loop automaton L[M) and multipli- 
cation in the monoid M. 

Lemma 4.1 (The Zig Zag Lemma). Let a : X* ^ M be a choice of 
generators for a monoid M, and let x,y € M. Let n > 1 and suppose 
uq, . . . Un~i, vi, . . .Vn G X* . Then the following are equivalent: 

(i) T(j{M) has a path from x to y labelled uoViUiV2 ■ ■ ■ Un-iv^; 

(ii) there exist po, . . . ,pn & M such that x = po, y = pn, and pi{uia) = 
Pi+i{vi+ia) forO <i <n. 

Proof. Suppose first that (i) holds, and let tt be a path from x to y labelled 
U0V1U1V2 ■ ■ ■ Un-iv^- Let Po = 1, and for 1 < i < n, define pi be to be the 
vertex reached by while tracing the path vr from x, after reading the prefix 
uqVi . . . Ui^iVi. That these vertices have the required property follows easily 
from the definitions. 

Conversely, if (ii) holds then, since the loop automaton contains the Cay- 
ley graph, there is a path tTj from each pi to pi{uia) labelled Ui and a 
path (pi+i from each pj+i to pi+i{vi^ia) = pi{uia) labelled fj+i. In the 
loop automaton, each path (j)iJ^i has an inverse path which runs from 
Pi{uia) = pi+i{vi+ia) to pi+i and has label Vi. Now the concatenation 

VTo^lVTi . . . TTn-lfpn 

is clearly a path with the properties required to show that (i) holds. □ 

Since every word in X* can be factored in the form uqViUiV2 ■ ■ ■ Un-iv^ for 
some words uq, . . . Un~i,vi, . . .Vn € X* , the Zig Zag Lemma gives a complete 
algebraic description of all the path labels between any two vertices. The 
special case of the Lemma in which n = 1 and x = y = 1 shows that 
the loop problem of a monoid encodes the word problem in the sense of 
universal algebra Section 30]. Substituting for M in the statement 
gives a corresponding result for semigroups. 

Corollary 4.2. Let a : X^ S [respectively, a : X* M] be a choice 
of generators for a semigroup [monoid] M , and let u,v £ X^ [u, v € X*]. 
Then ua = va if and only if uv Lu. 

From this, we see immediately that, unlike the sets of words representing 
the identity, the loop problem is sufficient to determine up to isomorphism 
the monoid or semigroup, together with its generating system. 
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Corollary 4.3. Let X be a finite alphabet, and Mi and M2 be monoids 
with choices of generators ai : X* Mi and 0"2 : X* — > M2. If L(j^{Mi) = 
L„^[M2) then there is a monoid isomorphism p : Mi — > M2 such that aip = 
£72. The corresponding statement for semigroups also holds. 

Proof. Suppose L„^{Mi) = L„^{M2). Define p : Mi ^ M2 by {wai)p = 
wa2. This map is well-defined since if uai = vai then by Corollary 14.21 we 
have uv € Lo-i(Mi); but then uv € Lcj^{M2) so by Corollary 14.21 again we 
haye u(T2 = va2. An entirely similar argument establishes that p is injectiye. 
To see that p is a monoid homomorphism, we observe that 

[{uai){vai)]p = {{uv)ai)p = {uv)(j2 = {ua2){va2) = [{uai)p][{vai)p]. 

Surjectiyity of p is immediate from surjectivity of a2. Finally, that aip = (T2 
follows straight from the definition, completing the result for monoids. 

The statement for semigroups follows directly from the definition and the 
result for monoids, with substituted for M. □ 

Lemma 4.4. Let a : X* M be a choice of generators for a monoid M , 
and suppose w £ L„{M). Then w labels a loop at every vertex in T(j{M). 
The corresponding statement for semigroups also holds. 

Proof. Let x e M. Write w = uqViUiV2 ■ ■ -Un-iv^ with each G X* . 

That w G LfjM means exactly that vu € X* labels a loop at 1 in ro-(M). 
By the Zig Zag Lemma fLemma 14. there exist pq, . . . ,pn G M such that 
Po = Pn = ^ and pi{uia) = pi^i{vi+ia) for < i < n. For each i, let 
Qi = xpi. Then we have Po = Pi = x, and for each i, qi{uia) = xpi{uia) = 
j+io") = Now applying the Zig Zag Lemma again, we 

obtain the required loop at x labelled w. 

The corresponding result for semigroups is once again obtained by sub- 
stituting 5^ for M, and applying the definition of the loop automaton of a 
semigroup. □ 

We apply Lemma 14.41 to prove a simple language-theoretic property of 
the loop problem. Recall that a language L is insertion-closed if whenever 
x,y,w are (possibly empty) words such that xy £ L and w £ L we have 
xwy E L. 

Proposition 4.5. Every loop problem is insertion-closed. 

Proof. By definition, every loop problem of a semigroup 5 is also a loop 
problem of the monoid with respect to a choice of monoid generators, so 
it suffices to consider the loop problem L„[M) of a monoid M with respect 
to some choice of monoid generators a : X* — s- M. Suppose xy E L^j^M) 
and w E Lfj(M). Then Tfj(M) has a vertex v such that there is a path from 
1 to V labelled x and a path from w to 1 labelled y. Now by Lemma 14.41 
there is a loop at v labelled w. Hence, there is a path from 1 to 1 labelled 
xwy so xwy E La-{M). Thus, La-{M) is insertion-closed, as required. □ 

It is well-known that the word problems of a group with respect to differ- 
ent finite choices of generators can be obtained from each other as inverse 
images under morphisms of free semigroups. Many important classes of lan- 
guages are closed under inverse morphism, and it follows that these classes 
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correspond to invariant properties of groups which do not depend upon the 
finite choice of generators. It transpires that the loop problems of monoids 
and semigroups enjoy a similar property. 

Proposition 4.6. Let M be a monoid and a : X* ^ M and t : Y* ^ M 

be finite choices of generators. Then L„{M) is an inverse morphic image of 
Lr{M). The corresponding statement for semigroups also holds. 

Proof. For each generator y € y, let Wy be a word in X* such that Wya = xt. 
Define a morphism p : Y* ^ X* by yp = Wy and yp = for all y (z Y. 
Note that upa = ut for every u (z Y* , and vp = Up for every v £ Y*. We 
claim that L^(M) = L^(M)p~'^. 
Suppose w &Y*. Write 

where uq, ■ • ■ Un-i,vo, . . . ,Vn G Y*. Then w £ Lt-(M) if and only if the loop 
automaton Tt-{M) has a path from 1 to 1 labelled w. By the Zig Zag Lemma 
(Lemma l4.1|) . this is true exactly if there exist elements po, . . . ,pn £ M such 
that po = Pn = I and piiuiT) = p^+l(^;^+lr) for < i < n. But this holds 
exactly if there exist elements pQ, . . . ,pn G M such that po = pn = 1 and 
Pi{uipa) = pi+i(t'i+ipfT) for < i < n. Now by the Zig Zag Lemma again, 
this is true if and only if 

wp = {uQp){vlp){uip){v^p) . . . {un-ip){n^p) 

= {uop){vTp){uip){v^) . . . {un-ip){n^) 

labels a loop at 1 in ro-(M), that is, lies in L„{M). 

For the semigroup case, if a : X~^ S and r : S are semigroup 

choices of generators, then : X~^ and : Y* are monoid 

choices of generators for the monoid S^, so arguing as above, we obtain a 
morphism p : Y* ^ X* such that 

U{S) = L,i{S^) = L^i{S^)p-' = L^{S) 

as required. □ 

Proposition 14.61 tells us that, for example, the classes of regular and 
context-free languages can be expected to correspond to invariant properties 
of finitely generated monoids. It is not, however, immediately obvious that 
these properties will be natural ones from a semigroup-theoretic perspective. 
In Section El below, we shall attempt to isolate the properties in question. 

Suppose M is a monoid. We remarked in Section |1] that M admits loop 
problems both as a semigroup and a monoid. Proposition 14.61 ensures that 
the different loop problems for M as a monoid are closely related, as are 
those for M as a semigroup. It does not, however, guarantee a relationship 
between the loop problems with respect to monoid generating sets, and those 
with respect to semigroup generating sets. The following proposition shows 
that there is nonetheless a close relationship. 

Proposition 4.7. Let M be a monoid, let a : X^ M be a semigroup 
choice of generators for M , and let r : X* M be the unique extension of 
a to a monoid choice of generators for M . Then 

L„{M) = Lr{M) n {XX* X U {e}). 
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and there exists a word w € such that 

Lr{M) = w-^L^{M)w-^ = {ueX* \ www G La{M)}. 

Proof. Since by definition La-{M) = L^i{M^), we need to consider both 
the identity of M and the extra identity adjoined in M^; we write e for the 
identity of M, and 1 for the identity of Observe that t„{M) = t^i (M^) 
is isomorphic to Tt-{M) with an extra vertex 1 adjoined. The edges incident 
with 1 are as follows; for each generator x X, there is an edge from 1 
to xa labelled x, and of course a corresponding inverse edge from xa to 1 
labelled x. In particular, whenever there is an edge from 1 to a vertex v 
[respectively, w to 1], there is also an edge in both T^(M) and Tt(M) from 
e to V [v to e] with the same label. 
We prove first that 

L^(M) = Lr{M) n {XX*X U {e}). 

It is immediate from our description above of the edges incident with 1, 
that every non-empty label of a loop at 1 in Tcr{M) must begin with a 
positive generator and end with a negative generator, so that L^{M) C 
XX* X U {e}. Moreover, from our observation that every edge incident with 
1 corresponds to an edge incident with e in Tt-{M), it follows easily that every 
word accepted by rcr(M) is accepted by rT-(M), so that L(^(M) C L^(M). 

Conversely, suppose u G Lt{M) fl {XX*X U {e}). If u = e then certainly 
u G Lct(M), so we may assume that u £ XX*X, that is, u = xvy for 
some x,y £ X and v G X* . Let vr be a loop at e labelled u in the loop 
automaton Tt{M). Then tt begins with an edge leaving e with label x; from 
the definition of the loop automaton, it follows easily that the target of this 
edge is the vertex xa. Similarly, tt ends with an edge from ya to e labelled y. 
In between is a path from xa to ya labelled v. But T(j{M) has an edge from 
1 to xa labelled x, and an edge from ya to 1 labelled y. Moreover, since 
Va{M) contains r^(M), it also has a path from xa to ya labelled v. Thus, 
we obtain a path in V„{M) from 1 to 1 with label u, so that u G L„{M). 
This completes the proof of the first claim. 

Now let w G X~^ be any non-empty word such that wa = e. We claim 
that 

Lr{M) = w-'^L^{M)w-^ = {ueX* \ www G L^{M)}. 

To show one inclusion, suppose u G Lt{M), so that u G X* labels a loop 
at e in Tt-{M). Clearly T^{M) has a path from 1 to e labelled w and hence 
a path from e to 1 labelled w. Since T„{M) contains Tt-{M), it also has a 
loop at e labelled u. Thus, Tcr{M) has a loop at 1 labelled www, so that 
u G w~'^L„{M)w~'^. 

Conversely, suppose that u G w^^ La-{M)w^^ , that is, that wuw G La-(M), 
so that T^{M) has a loop at 1 labelled wuw. It follows easily from the 
definition of the loop automaton that there is a unique path beginning at 1 
with label w, and that this path ends at e. Dually, there is a unique path 
ending at 1 with label w, and this path begins at e. Hence, F^iM) must 
have a loop at e labelled u. Moreover, because of our observation at the 
start of the proof that every edge incident with 1 has a corresponding edge 
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incident with e, we can find such a loop which does not visit the vertex 1. 
It fohows that Tt-{M) also has a loop at e labelled u, so that u G Lt-{M), as 
required to complete the proof of the second claim. □ 

Since taking an intersection with a regular languages, left and right trans- 
lation, and taking an inverse image under a morphism are all operations 
which can be performed by rational transductions 3_, Propositions l4.6l and l4.7l 
yield the following result, which says that the loop problem of a semigroup or 
monoid is, modulo rational equivalence, invariant under choice of generators. 

Corollary 4.8. Let S be a semigroup or monoid. Then any two loop prob- 
lems for S (as a semigroup or as a monoid if appropriate) are rationally 
equivalent. 

In view of Corollary 14.81 we shall permit ourselves to speak simply of 
"the" loop problem of a monoid or semigroup where, as is often the case, 
our interest is in language-theoretic properties which are invariant under 
rational transductions. 

5. Groups and Completely Simple Semigroups 

In this section, we consider the loop problems of groups, and then more 
generally of completely simple semigroups. 

Let a : X* ^ G be a choice of monoid generators for a group G. Recall 
that the word problem for G with respect to a is the language of all words 
w G X* such that wa = 1 in G. Equivalently, it is the language accepted by 
the Cayley graph Tfj{G) when viewed as an automaton with initial and ter- 
minal state the identity of G. An obvious question concerns the relationship 
between the loop problem for a group (with respect to a particular finite 
choice of generators) and the word problem. The following proposition says 
that they are almost the same thing. 

Proposition 5.1. Let a : X* G be a choice of monoid generators for 
a group G. Then there exists a choice of monoid generators t : X* — > G 
such that the loop problem L„{G) of G with respect to a is equal to the word 
problem of G with respect to r. 

Proof. Define r : X* G hy xt = xa and xt = (xa)^^ for all x G X. It is 
readily verified that T(j{G) is exactly the same as Tr{G), so that L(j{G) is 
equal to the word problem of G with respect to r, as required. □ 

Since we are chiefly interested in intrinsic structural properties of groups 
(and semigroup and monoids) which are not dependent upon a particular 
choice of generators, this correspondence is quite sufficient for our purposes. 

Next, we consider the broader class of completely simple semigroups. Re- 
call that a primitive idempotent in a semigroup S is an idempotent e with 
the property that for any non-zero idempotent / such that ef = fe = /, 
we have e = f . K semigroup is called completely simple if it has a primitive 
idempotent and no proper ideals. For a detailed introduction to the theory 
of completely simple semigroups, including a number of equivalent defini- 
tions, see jni Chapter 2] or 16, Chapter 3]. A subgroup of a semigroup S is 
a subsemigroup of S which forms a group with the multiplication inherited 
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from S; a subgroup of S is maximal if it is not properly contained in any 
other subgroup. A construction of Rees j24j gives a simple combinatorial 
description of completely simple semigroups, in terms of their maximal sub- 
groups. Let G be a group and / and J be sets, and let P be a J x I matrix 
with entries drawn from G. The Rees matrix semigroup M{G; I, J; P) is 
the semigroup with set of elements I x G x J and multiplication given by 
{i,g,j){k,h,l) = {i, gPjkh,l) for all i,k G I, g,h G G and j,l G J. The 
following theorem is usually attributed to Rees, although the essential idea 
was given by Suschkewitz j27j . 

Theorem 5.2 (Suschkewitz 1928, Rees 1940). Let G be a group, I and 
J sets, and P a J x I matrix over G. Then the Rees matrix semigroup 
M{G; I, J; P) is completely simple with maximal subgroups all isomorphic 
to G. Conversely, every completely simple semigroup is isomorphic to one 
of this form. 

Both completely simple semigroups and Rees matrix constructions are of 
central importance in the structural theory of semigroups. We shall need 
the following elementary property of completely simple semigroups. 

Lemma 5.3. Let H be a maximal subgroup of a completely simple semigroup 
S. If a G and x,y £ H are such that ax S H, then there exists b £ H 
such that bx = ax and by = ay. 

Proof. To prove the claim when a = 1 we simply take b to be the identity of 
H. Otherwise, by the Rees theorem as described above, we may assume that 
S = M{G] /, J; P) for some group G, sets / and J and J xl sandwich matrix 
P. It is well known, and can easily be deduced from the Rees theorem, that 
there exist is/ and j £ J such that the maximal subgroup H is the set of 
all elements of the form {i,g,j) for different g £ G. 

Thus, we may suppose that a = {ia,9a,ja), x = {i,gx,j) and y = {i,gy,j). 
Set b = {i, gaPj^iPjl^ , j) € H. Now we have b £ H and 

ax = iia,ga,ja)ii,gx,j) = (ia, OaPjaidx, j) ■ 

On the other hand, 
bx = {i,gaPj^iPj'i'^,j){i,gx,j) = {i,gaPjaiPj'i^Pji9x,j) = {i,gaPjai9x,j)- 

Moreover, since we know that ax € H, we must have ia = i, from which it 
follows that ax = bx as required. An similar argument (but using the fact 
that we already know that ia = i in order to avoid the need to presuppose 
that ay G H) shows that ay = by, completing the proof. □ 

The Rees theorem often allows results about groups to be extended in 
some form to completely simple semigroups. The following two theorems say 
that the loop problem of a finitely generated completely simple semigroup 
is closely related to the loop problem (or equivalently the word problem) of 
its maximal subgroups. 

Theorem 5.4. Let G be a maximal subgroup of a completely simple semi- 
group S. Let a : ^ G be a choice of semigroup generators for G, and 
T : y+ ^ S a choice of generators for S, such that X C Y and a is the 
restriction of t to . Then Lfj[G) = Lr{S) fl X* . 
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Proof. The loop automaton Tcj(G) of G is clearly embedded in a natural way 
into the loop automaton r^(S'). It follows easily that any word accepted by 
the former is also accepted by the latter, so that L„{G) C Lr{S) n X*. 

Conversely, suppose w is a word in Lr{S)r\X* . Then Tr{S) = T^i (S^) has 
a loop at 1 labelled w. Write w = uoViUiV2 ■ ■ ■ Un-iv^ with each Uj, € X* . 
By the Zig Zag Lemma fLemma 14. there exist pQ, . . . ,pn € such that 
Po = Pn = ^ and Pi{uiT) = pi+i(uj+ir) for < i < n. We claim that the 
elements po; • ■ • iPn-i can all be chosen to lie in G^. Indeed, suppose we are 
given po; • • • iPn-i satisfying the above equations and not all lying in G^, and 
let k be minimal such that does not lie in G^. Certainly, 1 < k < n — 1. 
Now we have u^-it E G and Pk-i S G"^, so that Pk-iiuk-iT) = Pki^kT) 
lies in G. Since f /jT and UfcT also lie in G, it follows by Lemma 15.31 that 
there exists an element qk £ G with qkivkT) = Pk{vkT) = pk-i{uk-iT) 
and qk{ukT) = Pk{ukT) = Pk+iivk+iT). Hence, we can replace pk with 
qk to obtain a sequence po, . . . ,pn with strictly fewer elements outside G^. 
Continuing in the same vein, we eventually obtain po, . . . ,pn € G^ with the 
desired properties. 

Applying the Zig Zag Lemma again, we now see that Ta-{G) = T„i{G^) 
has a loop at 1 labelled w, so that w £ Lt-{G), as required. □ 

Theorem 5.5. Let S be a finitely generated completely simple semigroup 
with maximal subgroups isomorphic to a group G. Then the loop problem 
for S is the Kleene closure of a rational transduction of the word problem 
forG. 

Proof. By the Rees theorem, as described above, we may assume that S = 
M(G; I, J; P) where by the main theorem of / and J are finite. Let 
a : X* G he a finite choice of monoid generators for G, and r : M 
a finite choice of semigroup generators for 5. For each y G 1", suppose 
yr = iiy,gy,jy) and let Wy,Wy G X* be words representing gy,gy^ G G re- 
spectively. For each i £ I and j G J, let Wji, w'j^ G X* be words representing 
Pji,P~-^ G G respectively. 

We define a finite state transducer from X to Y with 

• vertex set (I x J) U {A, Z} where A and Z are new symbols; 

• initial state A; 

• terminal state Z; 

• for each generator y £Y, an edge from A to {iy,jy) labelled {wy,y); 

• for each generator y £Y, an edge from {iy,jy) to Z labelled {w'y,y); 

• for each generator y £Y and each A; G J, an edge from {iy,k) to 
(iyjy) labelled {wkiyWy,y); and 

• for each generator y £ Y and each k £ J, an edge from {iy,jy) to 
{iy,k) labelled labelled {w'yw'i^^^,y). 

Now let g £ G, i £ I, j £ J and v £ Y* and suppose n is a positive 
integer. We say that a path in the loop automaton Ft (5') which originates 
at 1 is non-returning if it does not visit the vertex 1 at any point other than 
the start and (possibly) the end. As a first step towards the proof, we claim 
that the following conditions are equivalent. 
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(i) the loop automaton r^(S') has a non-returning path of length n from 
1 to {i,g,j) labelled v; 

(ii) the transducer has a path of length n from A to labelled {u,v) 
for some u G X* which represents g. 

The proof of equivalence proceeds by induction on the path length n. That 
the equivalence holds when n = 1 follows immediately from the definition of 
the edges in the transducer. Now let n > 1 and suppose true for smaller n. 

Suppose first that (i) holds, and let vr be the path given by the hypothesis. 
Let e be the last edge the path tt, and let vr' be the path vr with the last 
edge removed, so that tt = ir'e. Let v' be the label of tt'. Since n > 1 and 
TT is non-returning, the path tt' must end at a vertex of the form {i',g',k). 
It follows easily from the definition of the multiplication in a Rccs matrix 
semigroup that the vertices in the loop automaton corresponding to elements 
with first coordinate i are connected to the rest of the automaton only via 
the vertex 1. Hence, since the path vr is non-returning, we must have i = i', 
so that TT actually ends at {i,g',k). Now tt is a path of length n — 1, so by 
the inductive hypothesis, the transducer has a path of length n — 1 from A 
to {i, k) labelled {u',v') for some word u' G X* which represents g' G G. 

We now treat separately the case where e is a positive edge, and that 
where e is a negative edge. Suppose first that e is a positive edge, with label 
y EY. Then from the definition, the transducer has an edge from {i, k) to 
with label {wkiyWy,y). Hence, the transducer has a path of length n 
from 1 to with label 

{u,v'){wkiyWy,y) = {u'wkiyWy,v'y) = {u'wkiyWy,v). 

Now from the definition of the loop automaton, we must have 

{i,g,j) = ihg',k)iyT) = ii,g',k){iy,gy,jy) = {i,g'Pkiygy,jy)- 

Equating second coordinates, we see that g = g'Pki gy But it follows that 
g is represented by the word u'wkiyWy G X*, so setting u = WkiyWy G X*, 
we see that (ii) holds as required. 

On the other hand, suppose e is a negative edge, with label y for some 
y € Y. In this case the transducer by definition has an edge from (i, k) to 
with label {w'yw'^^^,y) . Hence, there is a path of length n from 1 to 
with label 

{u\v'){w'yw',,i^,y) = iuwyw'f,i^,v'y) = iu'wy^w^l^,v). 
Now from the definition of the loop automaton, we must have 

{i,g',j) = {hg\k){yT) = {hg',k){iy,gy,jy) = {i,g'Pkiygy,jy)- 

Again equating second coordinates, we see this time that gPkiygy = g' , so 
that g = g'gy^Pj^^, and g is represented by the word u'wyw'f.-^. Now setting 
u = u'Wy^w'i^^^, we again see that (ii) holds as required. This completes the 
proof that (i) implies (ii). 

Conversely, suppose (ii) holds, and this time let tt be a path of length 
n in the transducer from A to (i,j) labelled {u,v) for some u G X* which 
represents g. Much as before, we let e be the last edge of tt and tt' be the 
path TT with the final edge removed. Then tt' is a path of length n — 1 from 
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A to some vertex (i' , k) with label of the form (u', v'). Moreover, it follows 
easily from the definition of the transducer that i = i', so that vr' ends at 
{i,k). Let g' e G he the element represented by u'. Then by the inductive 
hypothesis, there exists a path of length n — 1 in the loop automaton from 
1 to {i,g',k) with label v' . Now e is an edge from {i,k) to From the 

definition of the edges in the transducer, we see that there exists y with 
jy = j such that e has label either {wkiWy,y) or {Wyw'i^^^,y). As before, we 
treat these two cases separately. In the former case, observe that we have 
u = u'wkiyWy from which we deduce that g = g'Pkiygy But now 

{i,g',k){ya) = {i,g'Pijygy,jy) = {i,g,j) 

so we see that the loop automaton has an edge from {i,g',k) to {i,g,j) 
labelled y. Combining this with the path whose existence we deduced using 
the inductive hypothesis, we conclude that the loop automaton has a path 
from 1 to {i,g,j) labelled v = v'y, so that (i) holds as required. An entirely 
similar argument suffices to show that (i) also holds in the case that e has 
label of the form {w'yw'^,^^^y), thus completing the proof of the equivalence 
of conditions (i) and (ii) above. 

Now let K be the language of all words which label non-returning loops 
at the identity in ry(S'). We claim now that K is exactly the image of 
the word problem of G under the transduction defined by our transducer. 
Clearly the Kleene closure K* is exactly the loop problem Lt{S), so this 
will suffice to complete the proof of the theorem. 

Suppose first that v G K. Then by definition the loop automaton ry(S') 
has a non-returning loop at 1 labelled v. Note that all edges in ry(S') which 
end at 1 run from vertices corresponding to generators y and have label y, so 
we may assume that the last edge of the path runs from a generator y to 1, 
and has label y. Let tt be the path without this last edge, so that vr runs from 
1 to y, and let v' G Y* be the label of this path, so that v = v'y. Then by 
the equivalence above, the transducer has a path from A to {iy,jy) labelled 
(n, v') for some u G X* representing gy. But directly from the definition, the 
transducer also has an edge from {iy,jy) to Z labelled {Wy^,y). Hence, we 
deduce that {uWy^,v) is accepted by the transducer, where uw~^ represents 
9y9y^ = 1 in G, so that v lies in the image under the transduction of the 
word problem of G. 

Conversely, suppose u is a word representing 1 in G, such that the trans- 
ducer accepts (n, v). Then the transducer has a path tt from Ato Z labelled 
{u,v). Again, we proceed by letting vr' be the path obtained from tt by 
deleting the last edge. Then there must exist a generator y such that tt' 
ends at {iy,jy)- Moreover, tt' must be labelled (u',v') where u = u'w'y and 
V = v'y. Now since u represents 1 and w'y represents p"^, we deduce that u' 
represents gy. By the equivalence above, it follows that the loop automaton 
ri-(<S') has a non-returning path from 1 to {iy,gy,jy) labelled v'. Now it 
certainly also has an edge from {iy,gy,jy) to 1 labelled y, so we deduce that 
v = v'y e K, as required. This completes the proof. □ 
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We remark that the transducer constructed in the proof of Theorem 15.51 
can be construed as a G-automaton (18| accepting part of the loop problem 
for S. 

Combining Theorems l5.4l and l5.5l with Prop ositions 14 . 7l and 15 . 1 1 we obtain 
the following. 

Theorem 5.6. Let ^ he a family of languages closed under rational trans- 
duction and Kleene closure, and let S be a finitely generated completely 
simple semigroup. Then the following are equivalent 

(i) the loop problem for S belongs to ^ ; 

(ii) the loop problem for each maximal subgroup of S belongs to ^ ; 

(iii) the word problem for each maximal subgroup of S belongs to ^ . 

6. Structural-Linguistic Correspondences 

In this section, we exhibit some correspondences between the structural 
properties of a monoid, and the linguistic properties of its loop problem. As 
a first step, we obtain a generalisation of a foundational result of Anisimov 

from combinatorial group theory. 

Theorem 6.1. Let a : X* M be a finite choice of generators for a monoid 
M. Then M is finite if and only if Lu{M) is regular. The corresponding 
statement for semigroups also holds. 

Proof. If M is finite then it follows immediately from the definitions that 
the loop automaton Tu{M) is finite. Hence, L^{M) is recognised by a finite 
automaton, and so is regular. 

For the converse, recall that a (right) cone of a language L Q X is a 
language of the form 

w^^L = {x € X* I wx £ L} 

for some w € X* . The Myhill-Nerode Theorem |I5i Theorem 3.9] states that 
a language is regular if and only if it has finitely many distinct right cones. 
Now let u,v (z X* be words in the generators of M. Then by Corollary 14.21 
we have uv L if and only if u and v represent the same element of M. It 
follows that V € u'^L{M) if and only if u and v represent the same element 
of M, and hence that L{M) has a distinct cone for each element of M. So 
if M is infinite then L{M) has infinitely many cones, and so is not regular. 

The corresponding statement for semigroups follows from the result for 
monoids together with the fact that a semigroup S is finite if and only if 
is finite. □ 

We now turn our attention to the class of monoids whose loop problem is 
a context-free language (see jSj). In the group case, a well-known theorem 
of Muller and Schupp [HI ■, augmented by a subsequent result of Dunwoody 
[H], says that a finitely generated group has context-free word problem if 
and only if it is virtually free, that is, has a free subgroup of finite index. 
Despite the straightforward combinatorial nature of this statement, the only 
known proof depends essentially on deep geometric results about group Cay- 
ley graphs, and specifically on Stallings' theory of ends HH]- We pose the 
following question. 
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Question 6.2. Can one characterize or even classify the monoids and semi- 
groups with context-free loop problem? 

We suspect this question (in general) to be difficult, and believe tfiat 
a satisfactory answer and tlie techniques used to obtain it may be a sig- 
nificant development with implications reaching well beyond combinatorial 
semigroup theory. Firstly, such a result may lead to a purely combinatorial 
understanding of the Muller-Schupp Theorem, which has been much sought 
after by combinatorial group theorists. Secondly, it may require the devel- 
opment of new tools for showing that certain languages are not context-free; 
since there are a number of notable languages conjectured, but not proven, 
to be non-context-free, such techniques are likely to be of considerable in- 
terest to formal language theorists 

We hope that seeking answers in different classes of "well-behaved" semi- 
groups may give an indication of how one could proceed in the general case. 
In semigroup theory, of course, "well-behaved" typically means one of "com- 
binatorially straightforward", "in some way group-like" or perhaps "combi- 
natorially straightforward modulo group theory". One might expect that 
the question can be answered in the first case by elementary combinatorial 
means, and in the last by application of the Muller-Schupp theorem. As one 
example, recalling that the class of context-free languages is closed under 
rational transduction ^ Corollary 4.2] and Kleene closure JS, Theorem 2.1], 
we can combine the Muller-Schupp Theorem with Theorem 15.61 to obtain a 
complete description of completely simple semigroups with context-free loop 
problem. 

Theorem 6.3. Let S he a finitely generated completely simple semigroup. 
Then S has context-free loop problem if and only if its maximal subgroups 
are virtually free. 

To conclude this section, we discuss briefly the relationship between semi- 
groups with context-free loop problem, and recent attempts to define word 
hyperbolic semigroups. A choice of representatives for a semigroup S con- 
sists of a choice of generators a : X+ S together with a subset R C X+ 
such that Ra = S. The choice of representatives is called regular if X is 
a finite and i? is a regular language. The multiplication table for S with 
respect to a and R is the language {u#vf^w^ \ {uv)a = wa} where # is a 
new symbol not in X, and denotes the word w written backwards. 

An interesting recent result of Oilman JJ| is that a finitely generated 
group is word hyperbolic (in the sense of Oromov |^) if and only if it admits 
a regular choice of representatives with respect to which the multiplication 
table is a context-free language. Duncan and Oilman [Jj proposed that the 
latter condition might form a suitable basis for a theory of word hyperbolic 
semigroups. Virtually free groups — that is, groups with context-free word 
problem — form an elementary class of word hyperbolic groups. It transpires 
that a more general relationship holds between semigroups with context-free 
loop problem and word hyperbolic semigroups in the sense of Duncan and 
Oilman. 

Proposition 6.4. Let S he a semigroup with context-free loop problem. Let 
a : X^ S he any choice of generators for S. Then the multiplication 
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table of S with respect to a and is context-free. In particular, S is word 
hyperbolic in the sense of Duncan and Oilman [7j. 

Proof. Consider first the language 

Ki = {u#v#w \u,v,w £ X*,uvw G La-{S)} 

It is an easy exercise to verify that Ki is a rational transduction of Lo-(5'), 
and hence is context-free. Now by intersecting with a regular set we see that 
the language 

K2 = Ki n {X*#X*#X*) 

is also context-free. Finally, applying the substitution x — > x for all x G 
X gives the multiplication table we require, so the latter is also context- 
free, n 

7. Right Cancellative Monoids 

In this section, we study the loop problem in the case of right cancellative 
monoids. We begin with a proposition which tells us that, in this case, the 
loop automaton is even more closely related to the loop problem than in 
general. 

Proposition 7.1. Let a : X* M be a choice of generators for a right 
cancellative monoid M. Then the loop automaton T„{M) is the minimal 
automaton for the loop problem Lo-{M). 

Proof. By definition, Tfj{M) is an automaton accepting the loop problem of 
M. Clearly there is a path from the identity vertex to every vertex, and a 
path from each vertex to the identity vertex, so that the automaton is trim. 

For determinism, suppose x G X and p,q,r G M are such that there 
are edges from p to q and from p to r both labelled x. Then we have q = 
p{xa) = r. On the other hand, suppose there are edges from p to q and from 
p to r both labelled x. Then q{xa) = p = r{xa) so by right cancellativity 
we deduce again that q = r. Thus, the automaton is deterministic. 

For minimality, it suffices to show that no two states in the loop automa- 
ton are equivalent, that is, have the same cone. To this end, let p and q be 
vertices in the loop automaton, that is, elements of M, and choose a word 
u G X* such that ua = p. Then there is a path from 1 to p labelled u, but 
no path from 1 to q labelled u. But now there is an inverse path from p to 
1 labelled u, but no path from g to 1 labelled u. Hence, u lies in the cone of 
p but not in the cone of q. We deduce that all vertices have distinct cones, 
so that the automaton is minimal. □ 

Combining Proposition 17.11 with Proposition 12.21 we immediately obtain 
the following description of the syntactic monoid of the loop problem of a 
right cancellative monoid. 

Corollary 7.2. Let a : X* M be a choice of generators for a right 
cancellative monoid M. Then the transition monoid of the loop automaton 
T(y{M) is the syntactic monoid of the loop problem L(^{M). 

We now describe an interaction between the loop problem and the classical 
structural theory of right cancellative monoids. Let M be a right cancellative 
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monoid. For each element m S M, define the right translation map pm '■ 
M ^ M hy xp = xm for all x € M. Since the monoid is right cancellative, 
each map pm is injective. Its (relational) inverse is the map p'^ : Mm M 
well-defined by {xm)p = x for all x G M. Viewed as partial bijections on 
M, the maps of the form pm and p^ generate an inverse monoid, which is 
called the inverse hull of M. Inverse hulls, were introduced by Rees j25j . 
who used them to study embeddings of cancellative semigroups into groups. 
A detailed study can be found in [HI Sections 1.9 and 1. 10]. 

Now let T : X* — > M be a choice of generators for M. It is readily 
verified from the definitions that for each x € X, the right translation map 
PxT is exactly the same as the transition map cj^ in the loop automaton 
Tr{M). Similarly, its relational inverse p~^ is the transition map ax in 
the loop automaton. Thus, we see that the transition monoid of the loop 
automaton Tt-[M) is exactly the inverse hull of the right cancellative monoid 
M. Combining with Corollary 17.21 we obtain the following relationship 
between the loop problem and the inverse hull of a right cancellative monoid. 

Theorem 7.3. Let r : X* M he a choice of generators for a right 
cancellative monoid M . Then the syntactic monoid of the loop problem 
Lt-{M) is the inverse hull of M. (Moreover, its action by partial maps 
on the minimal automaton of Lr{M) coincides with its action by partial 
bijections on M .) 

Note that the inverse hull of a group is easily seen to be isomorphic to 
the group itself. Hence, in this case, we recover the "folklore" fact that a 
group is the syntactic monoid of its own word problem. 

In the case that M is right cancellative, we can describe another nice 
language-theoretic property of the loop problem. Recall that a language 
L is called deletion-closed if whenever x,y,w are (possibly empty) words 
such that U! G L and xwy € L we have xy £ L. Note that a non-empty 
deletion-closed language will always contain the empty word. 

Proposition 7.4. Any loop problem of a right cancellative monoid is deletion- 
closed. 

Proof. Let a : X* — > M be a choice of generators for a right cancellative 
monoid M, and suppose xwy,w G L„{M). Then the loop automaton has 
vertices p and q such that there is a path vri from 1 to p labelled x, a path 
7r2 from p to q labelled w and a path tt^ from g to 1 labelled y. There is 
also a loop at 1 labelled w, so by Lemma there is a loop 7r4 at p labelled 
w. Now the paths tt2 and ir^ both start at p and have label w, and by 
Proposition 17. II the loop automaton is deterministic, so 712 and must end 
at the same vertex, that is, p = q. But now ttitts is a loop at 1 labelled xy, 
so xy & L as required. □ 

Combining Propositions 14.51 and 17.41 we obtain the following corollary. 

Corollary 7.5. Every inverse hull of a right cancellative monoid is the 
syntactic monoid of an insertion-closed, deletion-closed language. 

Proposition 7.6. Let L C X* be a formal language. Then the following 
are equivalent: 
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(i) L is insertion-closed and deletion-closed; 

(ii) L is the language of words representing the identity in its syntactic 
monoid; 

(iii) L is the language of words representing the identity in some monoid. 

Proof. Suppose (i) holds. Observe that the map w i-^ [w]l is a choice of 
generators for the syntactic monoid S{L). Now the fact that L is insertion- 
closed and deletion-closed means precisely that for any w £ L and any 
x,y £ X*, we have xwy G L if and only if xy € L. But this is true exactly 
if w is syntactically equivalent to the empty word, that is, if [w]l is the 
identity element in S{L). Hence, (ii) holds. 

That (ii) implies (iii) is immediate, so suppose now that (iii) holds, that 
is, that L is the language of words representing the identity in some monoid 
M generated by X. Clearly if xy and w both represent the identity in M 
then xwy also represents the identity, so L is insertion-closed. Conversely, 
if xwy and w represent the identity then xy represents the identity, so L is 
deletion-closed. Thus, (i) holds. □ 

Combining Theorem 17.31 with Propositions 17.41 and 17.61 we obtain the 
following additional description of the loop problem in a right cancellative 
monoid. 

Corollary 7.7. Let M be a right cancellative monoid. Then the loop prob- 
lem of M is the language of words representing the identity in the inverse 
hull of M. 

An application of the connections developed in this section to the study of 
a class of inverse monoids will form part of a forthcoming paper of J. B. Foun- 
tain and the author |9j. 
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